Changed the parse_run_traces to include last 4 letters of run_id #1880

Vidit-Ostwal · 2025-01-25T06:45:57Z

#1871
Changed the parse_run_traces function, to include the last 4 letter of the run_id of the trace, result of which is a unique key for every call.

Vidit-Ostwal · 2025-01-25T06:58:50Z

Hi @jjmachan, can you help me with this one, all the unit test have failed because of the PR #1879

Vidit-Ostwal · 2025-01-25T07:14:19Z

The code changed was small
From this

def validate_samples(self, samples: t.List[Sample]) -> t.List[Sample]:
    """Validates that all samples are of the same type."""
    if len(samples) == 0:
        return samples

    first_sample_type = type(self.samples[0])
    if not all(isinstance(sample, first_sample_type) for sample in self.samples):
        raise ValueError("All samples must be of the same type")

    return samples

to this


def validate_samples(self, samples: t.List[Sample]) -> t.List[Sample]:
    """Validates that all samples are of the same type."""
    if len(samples) == 0:
        return samples

    first_sample_type = type(samples[0])
    for i, sample in enumerate(samples):
        if not isinstance(sample, first_sample_type):
            raise ValueError(f"Sample at index {i} is of type {type(sample)}, expected {first_sample_type}")

    return samples

which does not change much of the functionality of comparing classes while iterating

Vidit-Ostwal · 2025-01-28T13:20:43Z

Hi @jjmachan, does this solution align with with the given issue?

jjmachan · 2025-01-30T05:40:29Z

src/ragas/callbacks.py

@@ -164,7 +164,7 @@ def parse_run_traces(
                prompt_trace = traces[prompt_uuid]
                output = prompt_trace.outputs.get("output", {})
                output = output[0] if isinstance(output, list) else output
-                prompt_traces[f"{prompt_trace.name}"] = {
+                prompt_traces[f"{prompt_trace.name}_{prompt_trace.run_id[:4]}"] = {


Suggested change

prompt_traces[f"{prompt_trace.name}_{prompt_trace.run_id[:4]}"] = {

prompt_traces[f"{prompt_trace.name}_{i}"] = {

what if we use i instead? would that be better?

right now I think it might be a bit more confusing, since users have no way of guessing the run_id. this would make it a bit more predictable - what do you think?

ps: Thanks a lot for taking the time to fix it 🙂

@jjmachan

Agreed the user won't have a clue about what run_id it is and therefore can be bit confusing, If we append the {i},
The output would be more like

{'claim_decomposition_prompt_1': {'input': ClaimDecompositionInput(response='Eifel tower is in Paris', sentences=['Eifel tower is in Paris']), 'output': ClaimDecompositionOutput(decomposed_claims=[['Eiffel tower is in Paris']])}, 'n_l_i_statement_prompt_2': {'input': NLIStatementInput(context='Paris, France is a city where Eifel tower is located', statements=['Eiffel tower is in Paris']), 'output': NLIStatementOutput(statements=[StatementFaithfulnessAnswer(statement='Eiffel tower is in Paris', reason='The context explicitly states that Eiffel tower is located in Paris, France.', verdict=1)])}, 'claim_decomposition_prompt_3': {'input': ClaimDecompositionInput(response='Paris, France is a city where Eifel tower is located', sentences=['Paris, France is a city where Eifel tower is located']), 'output': ClaimDecompositionOutput(decomposed_claims=[['Paris is a city in France.'], ['Eiffel Tower is located in Paris.']])}, 'n_l_i_statement_prompt_4': {'input': NLIStatementInput(context='Eifel tower is in Paris', statements=['Paris is a city in France.', 'Eiffel Tower is located in Paris.']), 'output': NLIStatementOutput(statements=[StatementFaithfulnessAnswer(statement='Paris is a city in France.', reason='The context does not provide any information about the location of Paris.', verdict=0), StatementFaithfulnessAnswer(statement='Eiffel Tower is located in Paris.', reason='The context explicitly states that the Eiffel Tower is in Paris.', verdict=1)])}}

This solution is more user-centric, considering that the last digit simply represents a sequence of events occurring within Ragas' reasoning process.

Vidit-Ostwal · 2025-02-02T12:55:48Z

Hi @jjmachan, updated the traces functionality to have {prompt_number} : {trace},

New traces should look like

{'prompt_1': {'name': 'claim_decomposition_prompt', 'input': ClaimDecompositionInput(response='Eifel tower is in Paris'), 'output': ClaimDecompositionOutput(claims=['Eiffel tower is in Paris.'])}, 'prompt_2': {'name': 'n_l_i_statement_prompt', 'input': NLIStatementInput(context='Paris, France is a city where Eifel tower is located', statements=['Eiffel tower is in Paris.']), 'output': NLIStatementOutput(statements=[StatementFaithfulnessAnswer(statement='Eiffel tower is in Paris.', reason='The context explicitly states that Eiffel tower is located in Paris, France.', verdict=1)])}, 'prompt_3': {'name': 'claim_decomposition_prompt', 'input': ClaimDecompositionInput(response='Paris, France is a city where Eifel tower is located'), 'output': ClaimDecompositionOutput(claims=['Paris is a city in France.', 'Eiffel Tower is located in Paris.'])}, 'prompt_4': {'name': 'n_l_i_statement_prompt', 'input': NLIStatementInput(context='Eifel tower is in Paris', statements=['Paris is a city in France.', 'Eiffel Tower is located in Paris.']), 'output': NLIStatementOutput(statements=[StatementFaithfulnessAnswer(statement='Paris is a city in France.', reason='The context does not provide any information about the location of Paris.', verdict=0), StatementFaithfulnessAnswer(statement='Eiffel Tower is located in Paris.', reason='The context explicitly states that the Eiffel Tower is in Paris.', verdict=1)])}}

Vidit-Ostwal · 2025-02-04T16:49:48Z

Hi @sahusiddharth, can you check, is this because of the previous unit test PR you have made?

jjmachan · 2025-02-04T19:21:12Z

hey @Vidit-Ostwal this is great - much for readable

but one concern is that this is too big of a change and might break - can I keep this on pause till v0.3?

we are planning to maybe release that with another feature we wanted. This way we keep the honor semver best

@sahusiddharth also roping you in on this

Vidit-Ostwal · 2025-02-04T19:26:39Z

@jjmachan, sure.
Can coordinate with @sahusiddharth as well, to resolve some conflicts.

Changed the parse_run_traces to include last 4 letters of run_id

3ce88fd

dosubot bot added the size:XS This PR changes 0-9 lines, ignoring generated files. label Jan 25, 2025

Merge branch 'explodinggradients:main' into Branch_1871

497635c

jjmachan reviewed Jan 30, 2025

View reviewed changes

jjmachan mentioned this pull request Jan 30, 2025

Traces in evaluation result are only saving the last prompt per trace name #1871

Open

jjmachan linked an issue Jan 30, 2025 that may be closed by this pull request

Traces in evaluation result are only saving the last prompt per trace name #1871

Open

Changed traces functionality

ea7b155

Updating the branch with new changesMerge branch 'main' into Branch_1871

79fc86e

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:XS This PR changes 0-9 lines, ignoring generated files. labels Feb 4, 2025

jjmachan force-pushed the Branch_1871 branch from 677a893 to 79fc86e Compare February 4, 2025 18:09

dosubot bot added size:XS This PR changes 0-9 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Feb 4, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Changed the parse_run_traces to include last 4 letters of run_id #1880

Changed the parse_run_traces to include last 4 letters of run_id #1880

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 28, 2025

jjmachan Jan 30, 2025

Vidit-Ostwal Jan 30, 2025

Vidit-Ostwal commented Feb 2, 2025 •

edited

Loading

Vidit-Ostwal commented Feb 4, 2025

jjmachan commented Feb 4, 2025

Vidit-Ostwal commented Feb 4, 2025

	prompt_traces[f"{prompt_trace.name}_{prompt_trace.run_id[:4]}"] = {
	prompt_traces[f"{prompt_trace.name}_{i}"] = {

Changed the parse_run_traces to include last 4 letters of run_id #1880

Are you sure you want to change the base?

Changed the parse_run_traces to include last 4 letters of run_id #1880

Conversation

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 25, 2025

Vidit-Ostwal commented Jan 28, 2025

jjmachan Jan 30, 2025

Choose a reason for hiding this comment

Vidit-Ostwal Jan 30, 2025

Choose a reason for hiding this comment

Vidit-Ostwal commented Feb 2, 2025 • edited Loading

Vidit-Ostwal commented Feb 4, 2025

jjmachan commented Feb 4, 2025

Vidit-Ostwal commented Feb 4, 2025

Vidit-Ostwal commented Feb 2, 2025 •

edited

Loading